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Signaling Design of Two-Way MIMO 
Full-Duplex Channel: Optimality Under 
Imperfect Transmit Front-End Chain 

Shuqiao Jia and Behnaam Aazhang, 


Abstract 

We derive the optimal signaling for a multiple input multiple output (MIMO) full-duplex two-way 
channel under the imperfect transmit front-end chain. We characterize the two-way rates of the channel 
by using a game-theoretical approach, where we focus on the Pareto boundary of the achievable rate 
region and Nash equilibia (NE). For a MISO full-duplex two-way channel, we prove that beamforming 
is an optimal transmission strategy which can achieve any point on the Pareto boundary. Furthermore, we 
present a closed-form expression for the optimal beamforming weights. In our numerical examples we 
quantify gains in the achievable rates of the proposed beamforming over the zero-forcing beamforming. 
For a general MIMO full-duplex channel, we establish the existence of NE and present a condition 
for the uniqueness of NE. We then propose an iterative water-filling algorithm which is capable of 
reaching NE. Through simulations the threshold of the self-interference level is found, below which the 
full-duplex NE outperforms the half-duplex TDMA. 

Index Terms 

full duplex two-way channel, MIMO, transmit front-end noise. Beamforming, Pareto boundary, 
Nash equilibrium. 


I. Introduction 

A node in a full-duplex mode can simultaneously transmit and receive in the same frequency 
band. Therefore, the wireless channel between two full-duplex nodes can be bidirectional, having 
the potential to double the spectral efficiency when compared to the half-duplex network. Due to 
the proximity of the transmit and receive antennas on a node, the overwhelming self-interference 
becomes the fundamental challenge in implementing a full-duplex network. The mitigation of the 



DRAFT 


2 


self-interference signal can be managed at each step of the communication network by passive 
and active cancellation methods flT[. In recent work [2}-4], the feasibility of the single input single 
output (SISO) full-duplex communication has been experimentally demonstrated. However, the 
performance is limited by the residual self-interference which is considered in JT] >4-6J to be 
induced by the imperfection of the transmit front-end chain. 

The performance bottleneck from imperfect transmit front-end chain has motivated recent 
research in full-duplex channel with transmit front-end noise. The performance of the SISO full- 
duplex two-way channel has been thoroughly analyzed in dH 13. The multiple input multiple 
output (MIMO) full-duplex two-way channel with transmit front-end noise is considered in 
SHE HI (in (5]|6l termed as MIMO full-duplex bidirectional channel). In 0, the transmit front- 
end noise was modeled as a white Gaussian noise. Following the transmission noise model, 
the effect of time-domain cancellation and spatial-domain suppression on a full-duplex channel 
were studied. In J6], a full-duplex channel was modeled with the transmit front-end noise and 
under the limited dynamic range. The authors then proposed a numerical method to solve the 
signaling that maximizes the lower bound of achievable sum-rate for such a full-duplex channel. 
The maximization of the weighted achievable sum-rate for a full-duplex channel was considered 
under the imperfect transmit front-end chain in [[8I| . 

Within this context, we consider optimally operating a full-duplex channel under imperfect 
front-end chains. We introduce a full-duplex channel model that includes the effect of imper¬ 
fect transmit front-end chain and limited transmitter dynamic range. Such a channel model is 
closely related to a Gaussian interference channel model, which were widely studied in jl9l4TTll. 
Inspired by the work in [3-flT], we consider a full-duplex two-way channel in a game-theoretical 
framework. Consequently, we characterize a full-duplex channel by Pareto boundary and Nash 
equilibrium. In game theory, Pareto boundary is a definition with the global optimality whereas 
Nash equilibrium is with the competitive optimality. Unlike the global optimality, competitive 
optimality is a definition of optimality that can be achieved by distributed algorithms. 

For a MIMO full-duplex channel, the Pareto boundary of the achievable rate region is described 
by a family of non-convex optimization problems. In the special case, where there is only a 
single receive antenna, we can decouple the original non-convex problems to a family of convex 
optimization problems lT2l[ 1~31 . By employing the semi-definite programing (SDP) reformulation, 
we then numerically solve the Pareto-optimal signaling by which the Pareto boundary of a 
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MISO full-duplex channel can be achieved. We further prove that the rank of Pareto-optimal 
signaling is always equal to one. That is to say, for a MISO full-duplex two-way channel, 
transmit beamforming scheme is capable of achieving the entire Pareto boundary. Furthermore, 
we propose a closed-form for the optimal beamforming weights. 

The Pareto boundary of a general MIMO full-duplex channel cannot be decoupled or trans¬ 
formed into a convex form. It implies that, to find the Pareto boundary, a family of centralized 
nonconvex problems needs to be solved, which renders the computation intractable. Therefore, 
for a general MIMO full-duplex channel we restrict our attention to the optimality which can be 
achieved by fully distributed algorithms. In other words, instead of the Pareto boundary, we aim 
to achieve the Nash equilibrium for a general MIMO full-duplex channel. In this paper, we first 
prove the existence of the Nash equilibrium for a MIMO full-duplex channel. We then derive a 
condition to ensure the uniqueness of Nash equilibrium. The signaling at the Nash equilibrium 
can be derived by our proposed algorithm, which is modified from the iterative water-filling 
algorithm (IWFA) in ifTTI . 

The rest of the paper is organized as follows. In Section II the channel model for a MIMO 
two-way full-duplex wireless channel is presented. Section III presents the description of the 
Pareto optimality and the competitive optimality, which correspond to the Pareto boundary and 
the Nash equilibrium, respectively. The characterizations of the Pareto boundary and the Nash 
equilibrium for a full-duplex channel are also provided. In Section IV the Pareto boundary of a 
MISO full-duplex channel is derived, where the beamforming scheme is proved to be opitmal. 
The closed-form solution for the optimal beamforming weights is then presented. Section V 
presents the existence of the Nash equilibrium for a full-duplex channel. The condition for the 
uniqueness of NE is also provided. Here, we propose a modified iterative water-filling algorithm 
to achieve the NE. Numerical examples are provided in Section VI. While the conclusions are 
given in Section VII. 

Notation : We use (•)'•' to denote conjugate transpose. For a scalar a, we use |a| to denote the 
absolute value of a. For a vector a e C Mxl , we use ||a|| to denote the norm, a^ to denote 
the k th element of a, Diag(a) to denote the square diagonal matrix with the elements of vector 
a on the main diagonal. For a matrix A e C MxM , we use A 1 , tr(A) and rank (A) to denote 
the inverse, the trace and the rank of A, respectively. We use diag(A) to denote the diagonal 
matrix with the same diagonal elements as A. A V 0 means that A is a positive semidefinite 
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Fig. 1: The MIMO point-to-point full-duplex network under study. The solid lines denote the 
direct channels and the dashed lines denote the self-interference channels. 

Hermitian matrix. We denote expectation, variance and covariance by E{-}, Var{-} and Cov{-}, 
respectively. Finally, C and El denotes the complex field and the Hermitian symmetric space, 
respectively. 


II. Channel Model 

In this section, we present the channel model for a MIMO full-duplex (FD) network with 
two nodes as illustrated in Fig. [lj We assume that two nodes indexed by i, j e {1, 2} share the 
same single frequency band for transmission. Each node has a transmitter and a receiver. The 
transmitter is equipped with M physical antennas and the receiver with N physical antennas, 
where each single antenna is connected to a front-end chain. The signal from transmitter i is 
collected as the signal of interest by receiver j, j ^ i, while appears at its own receiver i as the 
self-interference signal. 

As illustrated in Fig. |T] the direct channel between two nodes is denoted by yj, 
where r/y represents the average power gain of the direct channel. Similarly, the self-interference 
channel within each node is characterized by the channel matrix Hu and the average power gain 
r\u. According to H, all the channels in the above full-duplex network can be modeled as the 
Raleigh fading channel. That is, all channel matrices are with i.i.d complex Gaussian entries 
with zero mean and unit variance. We define 7* = — to represent the relative strength of the 
direct channel and the self-interference channel. 

While passing through the transmit front-end chain, the intended transmit signal is corrupted 
by distortions in the power amplifier, non-linearities in the DAC and phase noise @[ [5J. The 
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Fig. 2: The MIMO point-to-point full-duplex network under study. The solid line denotes the 
desired channel and the dashed line denotes the self-interference channel. 

results in [|4l El demonstrate that all the impairments induced by the imperfect front-end chain 
can be comprehensively modeled by an additive Gaussian noise, namely, the transmit front-end 
noise. Furthermore, the power of the transmit frond-end noise is (3 times proportional to that of 
the intended transmit signal due to the limited dynamic range of the transmit front-end chain 
flU [5). Here, [3 denotes the noise level of the transmit front-end chain 

Fig. [2] summarizes our full-duplex channel model. The signal at receiver i is given by 

Vi = VWi H ji( s i + e 3 ) + VViiHiiisi + e*) + rii, (1) 

where s* e C Mx 1 denotes the intended transmit signal at transmitter i, the channel matrices 
Hij e C NxAI . The transmit front-end noise e, is propagated over the same channel as s*. Denote 
the covariance of s t by Q i = Cov{sj}. Note that the m th diagonal element of Q t represents 
the transmit signal power of the m th antenna at transmitter i. Thus, e, can be modeled as the 
Gaussian vector with zero mean and covariance Cov{e ): } = /3diag(Q t ) [|6i j8l. The thermal noise 
at receiver i is modeled as n l ~ CAf(0. 1m), where In denotes the N x N identity matrix. 

At receiver i, the signal of interest j ^ 1 is received along with the self-interference 

signal HaSi and the transmit front-end noise H :i ,e v H vl e r . The power level of H jt e :j . Jj =2 % is 
typically much lower than that of the thermal noise n, and thus can be neglected [0. However, 
Hu e t is in the power level close to the signal of interest and needs to be considered for analysis, 
since the power gain of the self-interference channel Hu overwhelms the power gain of the direct 
channel Hji Q. 
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In addition to the strength, transmitters and receivers on a same node are relatively static, 
resulting in the long coherence time of self-interference channels, thus receiver i is assumed 
to have the perfect knowledge of its own self-interference channel Hu Note that receiver 
i also knows its own transmitted signal Then we can eliminate the self-interference H rr s t 
before decoding. The signal after cancellation is given by 


yi jiSj T -\/ijiiH jjCj T rij, (2) 

where Hue, represents the residual self-interference. 


III. Pareto Optimality and Competitive Optimality 

As shown in Q, the transmission from node j to node i is corrupted by the residual self¬ 
interference of node i and the thermal noise. The sum of all such interferences is equal to 
an additive Gaussian noise v t . Let us define S, to be the covariance of v,. then £, =/ + 
Pr)iiHudiag(Q i )H\ i , which is a function of Q r It follows from the results of [ 14, J_5| that by 
employing a Gaussian codebook at node 1, we can achieve the maximum rate for the transmission 
from node 1 to node 2 


RliQl, Q 2 ) = lo § det(I + 7712 ^ 2^2 1 H 12 Q i), (3) 

where (Q 1; Q 2 ) are the transmit covariance matrices of the nodes. Similarly, the maximum rate 
for the transmission from node 2 to node 1 is equal to 

RliQi, Q 2 ) = log det(J + r] 2 iH\ l T l ^ l H 2 iQ 2 ). (4) 


Denote the feasible set of the covariance matrix Q { as A set of (Q } -Q 2 ) is feasible if it sat¬ 
isfies the transmit power constraints P\, P 2 . Thus, we have A,; = [Q : e H M \Q l hO,tr(Q l )<P l } 
Once (Q 1 , Q 2 ) are given, only rate pair (ri,r 2 ) with r i < R] , r 2 < R 2 is achievable for the FD 
channel. Thus, the achievable rate region for the MIMO FD two-way channel with the transmit 
power constraints Pi,P 2 can be described as the following set: 


k = U 


Q 


> , 


(ri,r 2 ) : 

0 < n < RiiQi, Qi) 

0 < r 2 < R 2 iQi, Q 2 ) ^ 


(5) 
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where R\ and R 2 in ([3]) and <|4]), respectively, are mutually coupled by the covariance matrices Q , 
and Qo■ Therefore, there always exists performance tradeoffs between R\ and R 2 in a selection 
of (Q l: Q 2 ). Such tradeoffs can be considered as a game in which node i is player i, R i {Q l , Q 2 ) 
is the payoff of player i and Q i is the admissible strategy of player i. All possible outcomes of 
the game are characterized in the achievable rate region TZ. As a sequence, all concepts of a FD 
channel can be interpreted from a game-theoretic view. Driven by the global optimality, we first 
consider the Pareto boundary for a FD two-way channel. The Pareto boundary is characterized 
by a set of ’jointly’ optimal rate pairs (f?i, R 2 ). Each jointly optimal rate pair is of the Pareto- 
optimality, which is defined as follows (A similar definition can be found in lll6] - fT8lO . 

Definition 1 (Pareto optimality). A rate pair (R\,R 2 ) G TZ is Pareto optimal if there does not 
exist another rate pair (Ri,R 2 ) G TZ such that (Ri,R 2 ) > (R[,R 2 ) and (Ri,R 2 ) (R{,R 2 ) 

where the inequality is component-wise. 

The Pareto boundary refers to the outer boundary of the achievable rate region 1Z in ([5]). Thus, 
we can define the Pareto boundary 7 Z* as follows 

TZ* = l^J {all the Pareto optimal rate pairs (Rfi R 2 ) in TZ}. (6) 

Each point on the Pareto boundary TZ* maximizes one of the weighted sum-rates for the FD 
channel m ed- Therefore, TZ* can be derived by solving a family of weighted sum-rate 
optimization problems: 

max HiRi(QnQ 2 ) + H 2 R 2 (Q 1 ,Q 2 ) 

Q11Q2 ^ 7 ^ 

subject to Q i G Xi, i — 1,2, 
where 0 < /U 1; /i 2 and /ii + n 2 = 1. 

The optimal solutions {Q\- Q* 2 ) for problem ([7]) with some // |, /i 2 correspond to one pair of 
Pareto-optimal transmission strategies for the FD channel. To obtain the entire Pareto boundary, 
we need to derive all Pareto-optimal strategies. However, the centralized non-convex nature of 
problem ([7]) poses two serious issues in achieving the Pareto boundary. First, solving the Pareto- 
optimal strategies in general comes at the price of prohibitively high computationally complexity 
due to the non-convexity of problem (|7|). Second, problem ([7]) is coupled by Q 1 ,Q 2 . Hence, it 
requires an extra central node to acquire the full knowledge of the FD channel and then solve 
the Pareto-optimal strategies. Due to these challenges, it is often not practical to operate a FD 
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channel in its Pareto optimality. An alternative way is that each node would compete for its own 
payoff irrespective of the other node’s payoff. The optimality built in such a scenario is defined 
as follows. 

Definition 2 (competitive optimality). For a two-way FD channel, A) is the nonempty set of 
all feasible strategies for node i. A strategy profile (Q\. QV) G X\ x A 2 is competitive optimal 
if the following condition holds for all i, j G {1.2}, i j: 

Ri{Q*,Q*j) > Ri(Qi,Qj),VQi e (8) 


If the competitive optimality is achieved, any unilateral change of strategies would result in 
a rate loss for the FD channel ifTTtt . From the game theoretic view, a set of competitive optimal 
strategies corresponds to a Nash equilibrium (NE) of the FD channel. To obtain an NE, we 
construct a non-cooperative game according to Definition [2j In the game, node i is assumed to 
have the knowledge of the direct channel H tJ and its own self-interference channel H vj , and have 
a fixed power budget Pj. At each iteration, given the strategy of node j, node i locally chooses 
its strategy Q- to maximize its pay-off R,, which can be described by the rate-maximization 
problem as follows: 


max Ri(Qi, Qj) 

Qi 


(9) 


subject to Q { G Aj, 

In the non-cooperative game, the optimization problem ([9]) is repeatedly done by both nodes 
until an equilibrium is reached, if any. 

If deriving the Pareto boundary straight from problem {7]), one needs to simultaneously search 
Q | and Q > in A, x A 2 . In contrast, the non-cooperative game for an NE is in a fully distributed 
fashion, where each node derives its own Q, from AMore important, problem (|7]) is non-convex 
while problem ([9]) is convex. Accordingly, an NE promises much higher computational efficiency 
than the Pareto boundary. An NE is of the competitive optimality, however, not guaranteed to 
achieve the Pareto optimality. Therefore, the tradeoff between performance and computational 
efficiency should be considered in the strategy design for a FD channel. 

In the sequel, we will investigate further into the transmission strategy design within the 
framework of game theory so as to improve the performance of the two-way FD channel. We 
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will first consider the simple case where each FD node is equipped with single receive antenna, 
and then consider the general MIMO case. 


IV. MISO Full-duplex Channel 


We consider the scenario where all FD nodes are equipped with only one receive antenna 
i.e., N — 1. Consequently, all the channels are reduced to MISO, and can be represented by 
vectors h l;) , i, j e {1, 2}. The maximum rate for the channel from node i to node j can be then 
simplified as 



( 10 ) 


where (Q 1 ,Q 2 ) are the given transmit covariance matrices. In contrast with the general MIMO 
rates in ([3]) and ([4]), the rate for the MISO case in ( fTO] ) is in a simpler form, which then improves 
the efficiency in solving the Pareto boundary. 

A. Decoupled Optimization Problems 

The difficulty in deriving Pareto boundary for the MIMO FD channel is caused by the non¬ 
convexity and the coupled high-dimensional nature of problem (jTj). To render the derivation 
tractable, we need to decouple problem |7|) in terms of lower-dimensional variables. Inspired 
by the decoupling procedure in [T2l 18; [19j] we introduce an auxiliary variable z* to denote 
the power of the received signal at node j i.e., z, : = h\ :j Q t h ir With z t , we then construct the 
following optimization problem of Q, under the transmit power constraint Pf. 

min /it i diag(Q i )/i ii 

subject to hljQjhij = Zi (11) 

tr(Qi) < Pi, Qi h 0 

where i, j e {1, 2} and i =4 j . Here, we require 



so that problem ( [IT] ) always has a feasible solution. We define 1Z as the set of all optimal 
solutions for problem ([IT]). 
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Unlike problem (jTj) where the objective function is regrading Q , and Q 2 , problem <[TT]) depends 
only on Q r . In Lemma [I] we show that the Pareto boundary for the MISO FD channel can be 


alternatively characterized by solving problem (11). 


Lemma 1. For a MISO full-duplex channel with the transmit power constraint IT, any point on 
the Pareto boundary TZ* for the achievable rate region TZ in ([5]) can be achieved by the optimal 
solution Q* for problem (jTT|) with some z*. That is to say, 7 Z* C 7 Z. 


Proof: Denote the optimal value of problem (11) as T* (zf. Then, we can define R in terms 


of Zi and T*(zi ) as follows: 


TZ = 


U 


^ie[o,Pi|!h 12 || 2 ], 

22G[o,p 2 |!^2i|| 2 ] 


(ri,r 2 ) : 

T\ = log 11 + 

r 2 = log 1 + 


V 12 Z 2 


1 + Pvn^ii z i) 

V21Z1 


. 


1 + j3ri 22 T* 2 (z 2 ) 

For any point (Rf R* 2 ) on the Pareto boundary, assume that it is achieved by Q \ and Q* 2 . Q* 
is a feasible solution for problem (11) with z t = z* = h\^Q*hij where i,j G {1, 2} and 1 f j. 
Let i — 1, if Ql is not an optimal solution for problem (11) i.e., fi-^diag (Ql)hn > rj(;^) then 

V2l4 


while 


R*i < log 1 + 


R*2 < log 1 + 


1 + /7r/nFj(^) 


lll2 z l 

1 + ^ 22 Tl(z* 2 ) 


— Ri, 


= r 2 . 


As (7?i, R 2 ) belongs to 1Z and thus belongs to 1Z, R\ < R\ and R 2 < R 2 contradict to the Pareto 
optimality of (RfRf). Therefore Q\ is an optimal solution for problem ( [IT] ). In the same way 
we can show that Q 2 is an optimal solution for problem ( [IT] ). ■ 

We stress that the set 1Z is not necessarily equivalent to the Pareto boundary 1Z*, since TZ 
may include the rate pairs inside the region TZ. However, the relationship TZ* C TZ implies 
that any approach of obtaining the set TZ will suffice to derive the entire Pareto boundary TZ*. 
Furthermore, any result applying to TZ also works for TZ*. Hence, we proceed to explore the 
optimal signaling for the MISO FD two-way channel by the study of the set TZ. 
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B. Optimal Beamforming 

Problem ([IT]) is not a common optimization problem since the objective function includes 


the non-linear operator diag(-). By setting A, = h^h],, Ci = Diag(|/i-^| 2 ,..., \h\l 


AM) | 2 N 


and 


using the equivalent relationship fa^diag (Qfjhu = tr(C,Q ; ), we reformulate problem (11) to 
the semi-definite programming (SDP) problem as follows (See more details about SDP in liT3{| ): 

(13) 


min tr(C,Q ; 
Qi 


subject to tr(AjQJ = z t , e 

where Ci, Ai e H M . The above SDP reformulation reveals the hidden convexity of problem 
( |TT| ) so that we can solve it by employing the well-developed interior-point algorithm within 
polynomial time. Furthermore, we can numerically characterize the Pareto boundary for the 
MISO FD two-way channel in efficiency. 


The optimal solutions for problem (13) determine the signaling structure to achieve the rate 
pairs in the set 1Z. In Theorem [lj we explore the rank of optimal solutions Q* for problem |l 
where i,j e { 1 , 2} and i j. 


Theorem 1. For problem (13) with Pi > 0 and 0 < z % < there always exists an 

optimal solution Q* with rank(Q*) = 1. 


Proof: See Appendix A. ■ 

Note that the transmit signal with the rank-one covariance matrix can be implemented by 
transmitter beamforming. It follows from Theorem |T| that all points in the set 1Z, which include 
the entire Pareto boundary, can be achieved by the transmitter beamforming. Therefore, we 
conclude that transmitter beamforming is an optimal scheme for the MISO FD two-way channel. 
In Lemma [2] we derive the closed-form of the optimal weights for transmitter beamforming. 

Lemma 2. For node i in the MISO point-to point FD wireless network with the transmit 
power constraint Pi and complex channels ha,hij,i,j G {1,2}. /' f j , the optimal beamforming 
weights have the following form: 


\/~zi(C{ T eT) 1 hiq 

w* = V v - - -^ (14) 

h\j(Ci + el)~ l hij 

where C h = Diag(|h.-^| 2 ,..., \ h[f I] \-), constant z t is within the range 0 < z % < / 3 ||fi ZJ j| 2 and I 
denotes the M x M identical matrix. For a fixed z t , nonnegative constant e is adjusted to satisfy 






DRAFT 


12 


the transmit power constraint 11 w,, j | 2 < P 1 . Specially, e = 0 if 


Zi < 


PiVAjCr'h, j ) 2 


(15) 


Proof: The optimal beamforming weights can be obtained by solving problem ( |T3| ) with 
the rank-one constraint Q i = w^w] as follows: 

min w]Ci,Wi 

Wi 

subject to \w\hij\ 2 = 


(16) 


Wi 


<P- 


The above problem has the general closed-form optimal solution ( [T4] ) (see details in [|20ll ). 
Without the transmit power constraint ||iUj|| 2 < P, problem (16) has the following optimal 
solution (shown in 


) 


^fpc-'h 


W: = 


*2 


pc-'h,, 


(17) 


Combining (|T7}) and the condition (15), 


in* II 2 = 


ZjhlCj 2 h ij 


< Pi. 


Hence, we conclude that e = 0 under the condition ( fl5| ). ■ 

We remark that the optimal beamforming weights for node i is closely parallel to the direct 
channel hij, beamforming the signal of interest at node j. While the transmit front-end noise 
corresponding to the stronger self-interference channel is largely suppressed via the matrix (C, + 

el)- 1 - 


V. MIMO Full-duplex Channel 

In the MIMO full-duplex network, i.e., M > 1 and N > 1, there is in general no approach 
to decouple and convexify problem (|7[). To characterize the Pareto boundary, one needs to solve 
problem ([7]) for all possible weights (// 1 . fto). For each pair (// 1 , // 2 ), the optimal solutions for 
problem ([7]) can be obtained by an exhaustive search over (Q 1 ,Q 2 )- But the computational 
complexity of the exhaustive search is prohibitively high since the search is coupled by high¬ 
dimensional Q, and Q 2 . In the numerical methods, such as Gradient Projection, are used 
to improve the computational efficiency. However, any numerical method can not be guaranteed 
to find the global optimum due to the non-convexity of problem ([7]). In addition, problem ([7]) 
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can not be decoupled, implying that an extra central node is required to solve the Pareto-optimal 
solutions. The central node needs to have full knowledge of the FD network, which poses an 
extra difficulty in the implementation of a FD channel. Consequently, for the general MIMO FD 
channel we restrict our attention to the non-cooperative game, by which the FD channel can be 
operated in its competitive optimality. Such a game is convex and in a fully distributed fashion, 
rendering the computation tractable. At each iteration of the game, node i selfishly optimizes 
its own performance by changing its transmit strategy Q r . The objective is to achieve the Nash 
equilibrium, where each node’s transmit strategy is a best response to the other node’s strategy. 

A. Existence of Nash Equilibrium 

To obtain the Nash equilibrium (NE) for a FD channel, node i needs to maximize its rate R, by 
solving problem ([9]). The feasible set of problem ([9]) is X, = {Q i <E H 2 |Q, A 0,tr (Q,) < Pi}. 
We denote the optimal solution of problem (|9j) as B,(Q :J ), where £>,(•) is a function of Qj and 
Bi(-) : Xj X t . If Q- is given, then B, ( Q ■ ) satisfies 

Ri(Bi{Qj), Qj) > Ri(Qi, Qj),VQi e X (18) 

Thus, Bi(-) is called Best-Response function lUTIl . We then construct a mapping $ from the 
Best-Response function £>,(•): 

<f>(Q 1 ,Q 2 ) = (B 1 (Q 2 ),B 2 (Q l )), ( 19 ) 

where : X\ x X 2 >->■ X\ x X 2 . The input and output of $ are two sets of feasible transmission 
strategies for the FD channel, where the output strategy for node i is the best response to the 
input strategy for node j. At the fixed point of <E>, the input strategies are equal to the output 
strategies 

(Qi; Q 2 ) = ^(Qi> ^ 2 )- (20) 

It follows from ( fl8| ) and ( fl9| ) that the competitive optimality in Definition <[2]) is achieved at the 
fixed point. Hence, for a FD channel, a NE is equivalent to a fixed-point of the mapping <F. 
It follows that the NE can be achieved for the FD channel by deriving the fixed-point of the 
mapping $ in ( fl9| ). 

Unlike the Pareto boundary, which is the outer bound of the achievable rate region and thus 
always exists, the existence of Nash equilibrium is not obvious. In Lemma [3] we prove that a 
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FD channel always has a Nash equilibrium, regardless of transmit power constraints and channel 
realizations. 

Lemma 3 (Existence of NE). There always exists at least one Nash equilibrium for any MIMO 
two-way full-duplex channel. That is, the mapping $ in ( |T9| ) has at least one fixed point. 

Proof: See Appendix C. ■ 

Lemma [3] illustrates that any FD channel has at least one Nash equilibrium. Thus, it demon¬ 
strates that the NE can be considered as an applicable performance metric for MIMO FD two-way 
channels. 


B. Uniqueness of Nash Equilibrium 

Unlike the Pareto boundary having infinitely many points, a FD channel need not necessarily 
have multiple Nash equilibria. One example is the MISO FD channel. In the MISO case, the 


Nash equilibrium is achieved by the beamforming matrix Q EE = w^wj, where w, = 




y/Pihij 
hijhij 

Note that (Qi' e , Q 2 v/; ) depend only on the channel matrices and transmit power constraints, 
implying that the Nash equilibrium is unique for the MISO FD channel. It is then natural to ask 
conditions to guarantee the uniqueness of Nash equilibrium in a general MIMO FD channel. We 
denote the rank of matrix H, 3 as r l3 i.e., r l3 = ran k(iT Thus we have r tJ < min(M, N). We 
start by assuming the direct channel matrices are full row-rank matrices i.e., 

= N. In this scenario, the following Lemma [4] offers the sufficient conditions to ensure the 
uniqueness of NE for the FD channel. 

Lemma 4. Assume the direct channel matrices are with full row rank. The 

full-duplex channel is ensured to have a unique Nash equilibrium if 

P (hIh^h^Hu) p 


( 21 ) 


where p(X) denotes the spectral radius of the matrix X. 

Proof: See Appendix D. ■ 

To give the additional physical interpretation of Lemma |4j assume C(x) to be the cumulative 
distribution function (cdf) of p {h\ , HflH^ 1 H w 'j p . Following Lemma 

|4j the Nash equilibrium is guaranteed to be unique with probability Ci’ippjn/f 1 >l\ 17 / 22 )- Due 
to the non-decreasing property of cdf, C (7721 ^ 712 //3 2 / 7117722 ) increases as 7721,7712 increases, or 
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/3, ? 7 n, 7722 decreases. Note that for a FD channel 7721,7712 represent the power gains of the direct 
channels, whereas the strength of the residual self-interference is determined by /3, 7711 , 7722 . Thus, 
one can increase the probability that the FD channel has a unique Nash equilibrium by improving 
the direct channel gain or suppressing the residual self-interference. 

In Lemma |4j the uniqueness of NE is guaranteed by the contractive property of the mapping $ 
with respect to the weighted-maximum norm. However, without the full-rank assumption in the 
above lemma, the contractive property may not hold for the mapping $ even if condition ( |2Tj ) 
is satisfied. As an example, consider a symmetric FD channel with Pi = P 2 = 10, dip 1 / t / 2 1 = 
(3r]22/Vi2 = 1 , where the channel matrices are set as 


H u = H 22 = 


-0.1440 + 0.32037 
-0.4009 + 0.51497 
1.3155 + 0.56947 


-0.6735 - 0.00407 
-0.0351 + 0.61187 
-1.2339 - 0.49027 


( 22 ) 


H 12 = H 21 = 


1.1187 + 0.87947 
0.1281 - 0.39437 
1.5970 + 0.27087 


1.0068 - 0.06457 
0.8477 + 0.32487 
-0.3452 + 2.34507 


(23) 


Note that the mapping $ is a contraction with respect to the weighted-maximum norm only if 
there exists some w = [wi,w 2 \ > 0 such that 


$ [Qi , Q2 ) **■ $ (Qi , Q\ 

V Qi € Xu Q 2 £ X 2 , 


< 


QT\Q ( 2 


Qi\Q\ 


(24) 


where X t = {Q e H 2 |Q + 0, tr(Q) < 10}. Let 


Q? = Q'i’ = 


( 2 ) 


<37 = Q [ ? = 


0.2208 0 
0 9.7792 

0.4832 0 

0 9.5168 


The above set up leads to 

- (Qi\Q { 2 




* [Q\ \Q\ 


(25) 

(26) 

= 0.1804/ min (wi, w 2 ) and 


= 0.1728/ mm/uq, w 2 ), implying that condition (24) is not 


satisfied for any w = [w\,w 2 ] > 0. Hence, the mapping $ is not a contraction. However, 
p (H^H^i H 21 H n) = 0.4657 < 1 and p (Hl 2 H ^2 H^ H 22 ) = 0.4657 < 1, so condition 
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pTj ) is satisfied. The example therefore demonstrates that Lemma [4] is not true without the full 
row-rank constraints on iTi 2 ,i? 2 i- To extent the contractive property of the mapping $ to all 
FD channels, stronger conditions are needed. In Theorem [2j we derive the sufficient condition 
for a general FD channel to have a unique NE. 


Theorem 2. A full-duplex channel has a unique NE if aqa^ < 1, where aq is defined as 


OLi = 


-f> (H'uHjHjH* ) , if rankfff,,) = N, 

P_ 

li 


fHl + PVuPiP ( HliHu ))p( HIH U ) p ( Hj?H ^) , otherwise. 


(27) 


Proof: See Appendix E. ■ 

Using the inequality p(A^XA) < p(A^Y A), where Y y X A 0, we can obtain 

P < p (HlHu) p (HT/HJ 1 ) (28) 

< + (29) 

The above equality demonstrates that the condition in Theorem Q is stronger than the condition 
in Lemma (|4j). 

Theorem [2] can be interpreted from two perspectives. On the one hand, assume that the channel 
matrices are given, Theorem [2] then imposes constraints on (3. to ensure the unique Nash 

equilibrium for the FD channel. In the case that the direct channel gain p l3 is also fixed, aqct 2 < 1 
then indicates how small the residual self-interference must be to guarantee the uniqueness of 
Nash equilibrium. On the other hand, if /3,rjjj,r)ij are given, Theorem [2] then determines the 
probability that the FD channel is guaranteed to have a unique Nash equilibrium. In the following 
Corollary [lj we discuss a special FD channel, where all channel matrices are 

circulant. In such a case, Theorem [2] can be further simplified. 


Corollary 1. Assume that all the matrices i )2 j have circulant structures. The full- 

duplex channel is ensured to have a unique Nash equilibrium if 


|oii(fc)| s 


W22(k)\- 


rnvvi i^ 22 V'vi . 7i72 

max T-rjYfy ■ max ,- —— < (30) 

k=l,...,M \cr- 2 i(k)\ 2 k=l,...,M |ci 2 (/c) | 2 /3 2 

where <Jij(k) represents the k th eigenvalue of H, :r Moreover, {\a rj (k) \ are i.i.d. Rayleigh 


random variables with variance M. 
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The above Corollary implies that a circulant FD channel has a unique Nash equilibrium 

with probability P \ max • max °~ 22 fflL < -P- I • Furthermore, the analytical form 

F J \k=l,...,M l°-2i(fc)l 2 k=l,...,M l CT i2( fc )l 2 P 2 J : 

of the probability P can be derived if the channel matrices are symmetric i.e., H 12 = H>\ and 
H n = H 2 2 , 7i — 72 = 7- For the notational convenience, denote Y (x) as the cdf of the ratio 
A/B, where A and B are two independent Rayleigh random variables. The analytical expression 
of T(x) can be found in lf2TTl . Note that Wij(k)}ff =1 are independent Rayleigh random variables. 
It follows that {\(Tu{k)\/\(yji{k)\}^ = x are independently distributed with cdf Y{x). Under the 
symmetric channel assumption, the probability P thus can be written in terms of r(x), as 
follows. 


P 

= P 

M 


max 


Wn(k)\< 


max 


\°22{k)\- 


< 


7i72 


k=l,...,M \(J 2 i{k)\ 2 k=l,...,M \(7i 2 (k)\ 2 f3 

\vu(k)\ Fy 

max -—— < , / - 

k=l,...,M |cr 2 i(fc)| V p 


k =1 


|o~n(fc)| 

\cr 2 i (k)\ 


< 


1 7 | _ pM 


(31) 

(32) 

(33) 


C. Iterative Water-filling Algorithms 

For a given FD channel, we would like to find the transmit covariance matrices (Q 1 , Q 2 ) that 


achieve the NE under transmit power constraints. Following (20), it is equivalent to obtain a 
fixed-point for the mapping $ in (jT9j). 

We operate a non-cooperative game to obtain the NE for a FD channel. In the game, the 
transmission strategies are iteratively updated by the mapping $ in ( p~9| ). That is, node i changes 
its strategy as BfiQj) at each iteration of update. Note that BfiQj) can be easily obtained 
by applying the water-filling algorithm to problem (|9]) lITTS . Such a non-cooperative game is 
equivalent to implement the iterative water-filling algorithm (IWFA) in a fully distributed fashion 
dim We first assume that the IWFA is synchronous. That means, all nodes adjust their 
transmit covariance matrices simultaneously. Then, the transmission strategies at the k th iteration 
can be written in terms of the strategies at the {k — l) th iteration, 


(Qr,Qr) = *(Qr 


(34) 


Ideally, the IWFA converges to a Nash equilibrium at the I th iteration if the condition (Qi \ Q ( ' 2 ) = 
stQi’.Q®) is satisfied. In practice, however, we set the tolerance as a small positive number 
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S. The stopping criterion of the IWFA is then described as 


< 5 . 


(35) 


To deploy the synchronous IWFA, the synchronization for all nodes is required, which poses an 
extra issue in the implementation of a FD channel. In an enabled FD channel, the synchronization 
may not be available. The nodes may delay some updates and even miss some updates. Thus, in 
order to be robust in such case, we propose an asynchronous version of the IWFA. To describe 
the possible missing updates, we denote the strategies at k th iteration as where 


BiiQ) 1 ' 1 ' 1,1 ), if update at k th iteration is succeeding, 


th 


Q T i m = 


Q 


n 

r(k— 1) 


, if update at k th iteration is missing, 


(36) 


where 0 < r(k) < k. The asynchronous IWFA has the following stopping rule, 


Qf\Q 


r(fc) 

2 




j{k— 1) 
2 


< s. 


(37) 


To investigate the convergence of IWFA in a FD channel, we only need to consider the 
asynchronous IWFA. Since the synchronous IWFA is a special case of the asynchronous IWFA, 
where r{k) = k. V/r. In Lemma [5] the sufficient condition for the convergence of the asyn¬ 
chronous IWFA is derived. 

Lemma 5. If a full-duplex channel satisfies the condition (27) in Theorem (|2]), then the 
asynchronous IWFA can converge to the unique NE from any initially feasible transmit strategies 

(Qf’.Qf). 

Proof: The condition in Theorem [2] guarantees the mapping <F in (19) to be a contraction 
with respect to the weighted-maximum norm. It then follows from Proposition [2] that the FD 
channel has a unique Nash equilibrium. Further, the contractive property w.r.t. ||.||™ can be used 


to guarantee the asynchronous convergence. See the details of the proof in 

It follows from Lemma [5] that the global convergence of the asynchronous IWFA is regardless 
of the initial point. It implies that the unique NE solved by the asynchronous IWFA is globally 
asymptotically stable If22j if the FD channel satisfies the condition in Theorem [2| 
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VI. Numerical Results 
A. Performance of MISO Full-Duplex Channel 


We present the achievable rate regions for the MISO full-duplex two-way channels in Fig. 3a 


where the channels are symmetric i.e., h \ 2 = h 2 \ and hn = h 22 , rjn = 7/22 and rji 2 = 7 / 21 . Each 
node is equipped with M — 3 transmit antennas and single receive antenna with the transmit 
power constraints P 1 — P 2 — 1 . And we have 7, = 77— p u (in dB). Here we have 71 = 72 
due to the assumption of symmetry. For the notational convenience, we use 7 to replace 7 , 
in the sequel. Note that the self-interference channel gain pa can be reduced by the passive 
suppression a, which leads to an increase of 7 . The transmit front-end noise level is fixed 
with (3 = —40dB, where 6 is determined by the impairments of the transmit front-end chain. 
With the analog and digital techniques in 03 [73, such impairments can be partly compensated 
so that (3 can be reduced. In Fig. [3] each colored line represents the Pareto boundary of the 
achievable rate region for the channel with corresponding 7 . We conclude from the numerical 
results that the achievable rate region shrinks as 7 varies from — 20dB to — 60dB. However, the 
FD channel always outperforms than the half-duplex TDMA channel if the optimal beamforming 
is employed. The extreme points A, B of the rate regions on the axes represent the maximum 
rates in the case that only one-way of the two-way channel is working. It follows that the points 
A, B are only determined by the transmit power constraints P,. The ideal MISO FD two-way 
channel sets the outer bound for the achievable rate regions of all channels, doubling that of the 
half-duplex TDMA channel. 

In Fig. [3b] we evaluate the Pareto-boundary of the same FD channels as in Fig. 3a but with 

we illustrate 


(3 = —60 dB. Comparing the channel with the same 7 in Fig. 3a and Fig. 
that the achievable rate region is increased due to reduction of f3. The stars in Fig. |3b| represent 
the Nash equilibria for the FD channels. It can be observed that the Nash equilibrium is not 
Pareto-optimal except for an ideal FD channel. The circles denote the rate pairs achieved by 
ZF beamforming. Note that the circles are below the corresponding Nash equilibria except for 
the ideal FD channel. Thus, we conclude that the Nash equilibrium outperforms than the ZF 
beamforming for a MISO FD channel in presence of the residual self-interference. 

In Fig. [4aJ we geometrically compare the optimal beamforming weights, the NE and the 
zero-forcing beamforming weights. For simplicity, we assume all channels to be 2 x 1 real- 
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(a) The achievable rate regions for the symmetric MISO FD 
two-way channels with /? = —40 dB, ||/1121| = ||/i2i|| = 

Pi = P2 = 1. As plotted for comparison is the half-duplex 
TDMA achievable rate region. 


(b) The achievable rate regions for the symmetric MISO two- 
way FD channel with f 3 = —60 dB, ||A1121| = ||A1211| = 1, 
Pi = P2 = 1. Circles denote the ZF beamforming rates. 
Stars denote the NE. 


Fig. 3: The comparison of the FD optimal beamforming, the Nash equilibrium and the zero¬ 
forcing beamforming 


value vectors. We further assume the channels to be symmetric i.e., hu = h 22 and h v > = h 2 \. 
Assume the transmit power constraints P 2 = P 2 = P. Then all possible beamforming weights 


are contained in the disc with radius \/~P. In Fig. 4a, OF represents the optimal beamforming 
weights by which the rate pair with maximum sum-rate can be achieved. The zero-forcing (ZF) 


beamforming weights is represented by OZ. Note that OZ restricts the transmit signal orthogonal 
to the self-interference channel ha. Compared with the ZF beamforming weights, the optimal 


OF is not orthogonal to h tl but has greater length of projection on the direct channel hij. 
Among all the weights, the Nash equilibrium OM has the greatest length of projection on h^. 
However, the Nash equilibrium is outperformed by the optimal beamforming weights due to the 


larger amount of self-interference generated by OM than OF. We remark that the direction of 
NE coincides with that of the direct channel in the MISO case. However, it is not true for 
a general MIMO FD channel. 


The achievable rate region shown in Fig. 3b can be achieved only if the infinite-length Gaussian 
codebook is employed by all nodes. However, the communication systems in practice always 


choose the discrete-alphabet modulation schemes, such as QPSK or QAM. In Fig. 4b, we show 
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(a) The geometric comparison of the full-duplex optimal (b) The bit error rate of a symmetric full-duplex QPSK 
beamforming, the Nash equilibrium and the zero-forcing system. Here. M = 3, N = 1, P — 1, /3 = —60 dB, 


beamforming. 


7 = —40 dB. 

Fig. 4 


the error rate performance for one of the MISO FD channels in Fig. [3b] where 7 is equal to 
—40dB. We choose QPSK as the modulation scheme and employ the optimal beamforming, 
Nash equilibrium and zero-forcing beamforming for the FD channel respectively. Note that the 
FD channel is chosen to be symmetric, implying that the transmission from node 1 to node 2 
has the same bit error rate as the transmission from node 2 to node 1. Therefore, we only show 
the BER performance for one directional transmission. 


B. Performance of MIMO Full-Duplex Nash Equilibrium 

It follows from Theorem [2] that the uniqueness of NE depends on the channel realizations 
With given /3, ij l3 and r/ n , Theorem [5] then gives the probability that NE is 
guaranteed to be unique. Specially, for a symmetric circulant FD channel, the probability derived 
from Theorem [2] can be analytically expressed in (f33|). In Fig. [5a] we plot the probability in (33) 


as a function of 7 = 71 = 72 . It follows from Lemma [5] that the condition in Theorem [2] suffices 


to guarantee the convergence of asynchronous IWFA. Accordingly, Fig. 5a is an analytical lower 
bound for the probability that IWFA can asynchronously converge to a NE. Moreover, the unique 
NE is globally asymptotically stable if the condition in Theorem [2] is satisfied. Thus, the larger 


probability implies that the FD NE is more stable. Accordingly, Fig. 5a indicates the stability 
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(a) Probability (lower bound) that a FD channel has the 
unique NE for /3 = — 40dB, fj = — 60dB. Here, M — N = 
3. The channel has a circulant symmetric structure. 



(b) Probability that the iterative water-filling algorithm con¬ 
verges to a Nash equilibrium in X steps. Here, M = N = 3, 
P = — 60dB. 


Fig. 5: The performance of the synchronous IWFA 


of operating a FD network at its NE. 

As illustrated in Figj5a| the probability in (33) can be increased by either reducing 13 or 
increasing 7 . For simplicity, assume the direct channel gain is fixed. Then both the reduction of 
/3 and the increase of 7 lead to the mitigation in residual self-interference. It implies that, for 
a FD channel, the stability of its competitive optimal strategies can be improved by mitigating 
the residual self-interference. 


In Fig. 5b we evaluate the computational efficiency of the synchronous iterative water-filling 
algorithm as a function of 7 . We randomly produce 100,000 channel realizations of a 3 x 3 FD 
channel, testing the probability that the synchronous IWFA converges in X steps, which are then 


plotted in Fig. 5b 


The probability that the asynchronous IWFA converges to a Nash equilibrium in X steps 
increases as 7 increases. It implies that the average number of iterations, which is required by 
IWFA to reach a Nash equilibrium, reduces with an increase of 7 . Therefore, the computational 
efficiency of IWFA is improved if the gain of the direct channel over the self-interference channel 
increases. 

In Fig. [ 6 j we compare the performance of FD Nash equilibrium with half-duplex TDMA in 
terms of the achievable sum-rate. Each node is equipped with M — 3 transmit antennas and 
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Fig. 6 : Average achievable sum-rate for a symmetric MIMO two-way FD channel with (3 = 
-60 dB, Pi = P 2 = 10 and M = N = 3. 

N = 3 receive antennas with the transmit power constraints Pi = P 2 = 1. The transmit front-end 
noise level is fixed with (3 = —60dB. For simplicity, the FD channel is assumed to be symmetric 
i.e., H 12 — H 2 i and iTn — H 22 with rj n — rj 22 and r/i 2 — rj 2 \, where H,j ~ CJ\f(0, Imn)- 
The curves are averaged over 1000 independent channel realizations. 

The performances of both FD NE and half-duplex TDMA are improved as the direct channel 
gain r]ij increases. If the self-interference channel gain >j vl is less than 50dB, then the direct 
channel gain rj t j dominants the performance of the FD channel in terms of the achievable rate. 
The achievable sum-rate of FD NE is close to double that of half-duplex TDMA, implying that 
it is nearly optimal to operate at FD NE within the low rju region. Beyond the low //,, region, 
the sum-rate of FD NE linearly decreases with the increase of rj lt . This is due to the fact that the 
residual self-interference leads the performance of the FD channel beyond the low r]a region. In 
contrast, TDMA works in half-duplex mode, thus the performance of TDMA is not affected by 
the self-interference. It follows that the gap between FD NE and half-duplex TDMA decreases 
as r]a increases with a given // );? . Interestingly, NE and TDMA perform evenly at r/„ = 67dB, 
)],, = 69dB and rja = 72dB, if 7 / i? = OdB, = KMB and ij t] = 20dB respectively. It implies 
that the tradeoff between FD NE and half-duplex TDMA is mainly determined by the self¬ 
interference channel gain r/„ : instead of the direct channel gain riij. Moreover, the achievable 
sum-rate of NE is smaller than that of TDMA with regardless of the value of 77 ^ if is beyond 
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72dB. It follows that the NE of the FD two-way channel can bring extra benefits than the half¬ 
duplex TDMA two-way channel only if the self-interference channel gain can be suppressed to 
be lower than 72 dB. 


VII. Conclusion 

We considered the MIMO point-to-point full-duplex wireless network under the imperfect 
transmit front-end chains. The network was modeled by a MIMO two-way full-duplex channel 
with transmit front-end noise, which was then studied from a game-theoretic view. We charac¬ 
terized the Pareto boundary for a MISO two-way full-duplex channel in presence of the transmit 
front-end noise. Using the decoupling technique and SDP reformulation, we proposed a method 
to obtain the entire Pareto boundary by solving a family of convex SDP problems, rather than 
the original non-convex problems. We showed that any rate pair on the Pareto boundary can 
be achieved by the beamforming transmission strategy. Moreover, we provided the closed-form 
solution for the optimal beamforming weights of the MISO full-duplex two-way channel. 

For the general MIMO full-duplex two-way channel, we achieved the competitive optimality 
by establishing the existence and the uniqueness of Nash equilibrium. We then modified the 
classical iterative waterfilling algorithm to efficiently reach the NE for a MIMO two-way full- 
duplex channel. Through our numerical results, we demonstrated that the transmit front-end noise 
level (3 and the direct-to-self-interference channel gain ratio 7 can influence the full-duplex NE 
from the following perspectives. First, the computational efficiency of reaching a full-duplex NE 
improves with the reduction of (3 or the increase of 7 . Second, the stability of operating on full- 
duplex NE improves as (3 decreases or 7 increases. Finally, we found that the self-interference 
channel gain r/„ : = 72dB is the threshold, beyond which the full-duplex NE is outperformed by 
the half-duplex TDMA in achievable rates. 

VIII. Appendix 

A. Proof of Theorem [7] 

We prove Theorem [T] by the primal-dual method. Note that problem ( |T3| ) is feasible and 
bounded. It follows that its dual problem is also feasible and bounded ||T3l . Assume Q* is an 
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optimal solution for problem ( p~3| ). From [|T3ll . problem ( [13] ) has the dual problem as follows: 

(38) 


min AiZj + A 2 P 

Al,A2 


subject to Z = Ci — XiAi — A 2 J h 0. 


where P = tr(Q*). Assume ((Aj,A * 2 ),Z*) are the optimal solutions for ( [38] ) . We denote the 
rank of Q* by r. We assume r > 1. Following that Q* is positive semi-definite, Q* can then 
be written as Q* = W' 1 via the singular-value decomposition where V € C Mxr . 

Next, we consider the following two linear equations defined by A, and V: 

tr (V^AiVX) = 0 

tr(X) = 0 


(39) 


where the unknown matrix X e IF contains r 2 real-valued unknowns, that is, 
real part and f or the imaginary part. 


r{r-\- 1) 


for the 


The linear system (39) must have a non-zero solution, denoted by X*, since it has r~ unknowns 
where r > 2. By decomposing the Hermitian matrix X*, we obtain X* = UT,U\ where U is 
an r dimensional unitary matrix and £ is the diagonal matrix, £ = Diag(ui,..., oy). Without 
loss of generality, we assume [oil > |cr 2 | ■ > |o>|. Non-zero matrix X* has at least one 


non-trivial eigenvalue, thus \a\ > 0. Next, we construct a new matrix as follows: 


Cl 


(40) 


Note that I — X' A 0. It follows that Q* Vj is semi-positive definite. Next, we show that 
is also an optimal solution for problem ( |T3| ). Note that is optimal for problem (13) 
if and only if (Aj, A 2 ), Z*) satisfies the KKT conditions, including the primal feasibility, 
the dual feasibility and the complementarity ll24ll . As ((A*, A 2 ), Z*) is unchanged, the dual 
feasibility is automatically satisfied. Therefore, we need only to prove the primal feasibility and 
the complementarity of Q*i). 

Q*^ is a feasible solution for problem (13), since the following two equations hold for Q' {V) , 
tr (A t Q* {1) ) = tr(A i V(I--X*)V') (41) 

Cl 

1 


tr(Q 


(i)> 


= tr{V ] AiVI) - tr(V ] A,yX*) = z h 

Cl 

= tr(V(J- — X*)V ] ) 

Cl 

= tr(V^VI) - — tr(V^VX*) = P. 

Cl 


(42) 
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To show the complementarity, note that tr (Q*Z*) = tr(' V^Z*V) = 0 and V 1 Z*V A 0 
implies that Z*V = 0. It follows that 


tr (Q* (1) Z*) = tr(V(J- ~^X*)V ] Z*) 


o-i 

= tr((J- — X*)V f Z*V) = 0. 

o-i 


(43) 


Therefore, Q*^ is an optimal solution for problem (|13|). Furthermore, the rank of Q*\ ) is strictly 


r (i) 


smaller than r since rank(Q* 1 ^) = rank(J — 4-X*) < r. 

We can repeat this process as Q *, Q*ip Q( 2 ), • • •, until rank(Q( fc )) < y/2. In other words, the 
rank of the optimal solution can be strictly decreasing to rank(Q( /;j ) < y/2, that is, rank(Q* fc )) = 
1. 


B. Properties of The Best-Response Function B (■) 

Introducing the constant Pi and the matrices {H the best-response function B,(Q 

denotes the optimal solution of the following optimization problem: 


max log det( J + l H ij Q 


Q 


(44) 


subject to Qi e Xi, 

where i,j e {1, 2} and i f j; E* = J + fh) u H,,d\ag(Q i )H] i which is a full-rank square matrix. 
According to [fill Lemma 11.1 and Lemma 11.2], problem (44) can be equivalent to 

Xo(Q 3 )\\f 

subject to Qi e Xi, 


min IQ, 
Qi 


(45) 


-i 


where X t is a non-empty closed convex set; X 0 (Q ; ) = — [VijHlj Ej- Hij) - ,) 

with the positive constant q that can be chosen independent of Q ; and the matrix Pm(h j: ) that 
depends only on the channel matrix H It follows that B, (Q ; ) represents the point in A) which 
is closest to X 0 (Qj). The best-response function £>(•) can then be alternatively interpreted as a 
metric projection which projects the matrix X {j (Q-) onto the set A). 

The interpretation of the function £>(•) as the metric projection implies the non-expansive 
property as follows: 


I bM 


(1) ) 

3 > 


BfQf )|| F < ||X 0 (qS 1) ) - VqS 1} ,qS 2) g C m * m . 


(46) 
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Building on the continuity of the metric projection and using the continuity of X 0 (Q i ) on 
Q : , one can obtain the continuous property of the best-response function B(Q i ) on Q r 


C. Proof of Lemma [J] 

Following ( [20] ), a Nash equilibrium is equivalent to a fixed-point of the mapping $ in (19). 
Thus, we will use the following results in the fixed-point theory from [fill 25] to show the 
existence of NE. 


Proposition 1. Given a mapping T : X i-> X. If X is nonempty, convex and compact subset 
of a finite-dimensional normed vector space and T is a continuous mapping, then T has at least 
one fixed-point. 

Using Proposition [Tj it suffices to show the existence of fixed-points for the water-filling 
mapping $ in ( [T9| ). The water-filling mapping $ in ( p~9] ) is defined on the set X\ x X 2 , where 
Xi = (Qe W M \Q P 0, tr(Q) < Pi}. It follows from the results in [fl3l that X % is a convex 
and compact subset of C M ~ for any P, £ M. Thus, X is nonempty, convex and compact set 
with a finite dimension for any transmit power constraints P\ and P 2 . In Appendix B, We prove 
the continuity of the Best-response function £>,(■) on X r for any given set of channel matrices 
{Hij}ij e u 2 \. It follows that the mapping $ is continuous on X\ x X 2 for any channel realization. 
Lemma [3] is then an immediate result following from Proposition [Tj 


D. Proof of Lemma [T] 


Following the equivalence built in (20), we can instead derive sufficient conditions for the 
uniqueness of fixed-point in mapping <&. To do so, we need to use the following Proposition 
from 


Proposition 2 (Uniqueness of fixed-point). Let T : X i-> X be a mapping defined on a 
finite-dimensional set X. If T is a contraction with respect to some norm || • ||, that is, there 
exists some scalar a £ [0,1) such that 

||T(a:^ 1 ' ) ) — T(x^) || < ajlaT 1 ) — aT 2 - ) ||, Wx^\x^ £ X. (47) 


Then, there exists at most one fixed point of T. 
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To apply Proposition [2] for the mapping $ in ( [T9| ), we need to define a suitable norm on the 
vector space W M x W M in which the mapping $ is defined. Inspired by the work in we 
introduce the weighted-maximum norm, which is defined as: 


(*r,* 2 )||£= max 

ie{l,2} 


IX 


i If 


Wi 


(48) 


where X i: E H M , || • || r is the Frobenius norm and w = [wi,w 2 ] > 0 is any positive vector. 

Next, we focus on the contraction property of the mapping with respect to the weighted- 
maximum norm || • ||™. That is to show the conditions for the existence of a E |0,1). w > 0 
such that 


* (Qff Q^ 1 ’) - * 


< a 


Qi'-Q? 


Qi’-Qf 


(49) 


V (Qi, Q 2 ) E Xi x X 2 . 


For the convenience of the notation, we introduce e, and e# defined as 


e,; = 


Q l 1 ’ - Q 


( 2 ) 


A 

, e $[ — 


bAq 


»(i) 


BAQ 


»( 2 ) 


(50) 


where i,j E {1, 2} and i ^ j. 


Given two sets of strategies ( Q^\ Q !') , ( QY\ Q'A ) E X 1 x X 2 , we can rewrite the left 


(2) /-)(2) 


and right side of (49) in terms of the vector e = [ei, e 2 ] T and e$ = [e^, e# 2 


l e lloo = 


|c*C = 


Q['\q‘E 


Qi\Q? } 


$ (q?, Q?) - * {Q?\ Q® 


where the norm ||cc||™ of a vector x E C 
w > 0. 


Kx 1 


is defined as ||a;||“ = max 

1100 ie{i-K} 


as 

(51) 

(52) 
w ith some 


The inequality in (49) can then be rewritten as 


|e*||£<a||e||” 


(53) 


We start to assume that the direct channels H tJ are full row-rank. We continue to show the 
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expression of a by using the similar procedure in [j9|. First, we have 


e &i = 
< 


Si (Q; l: i - Si (Qf 


X„(Qf) - X a (Qf' 




-1 




T hj ij 1) 


-1 


+ CjPji^H..) 


(diag (q< 1( 


diag Q 


,( 2 ) 


HlH7' 


33 V 


< P I ~ 


< P 


= P 


P7 H n H P H P H „ 

13 


diag (Q:- ] 

Q? ~ Qj 


diag Q 


,( 2 ) 


(54) 

(55) 

(56) 

(57) 


H 33 H U H ' 3 lH .U I ej 


-tn-i; 


where (54) follows from inequality (46); (55) follows from (45); (56) follows from the reverse- 
order law for the generalized inverse i.e., (A* XA)~ l = A -1 X _1 A - ' ifTTIl - which is valid if A 
is full row-rank matrix; (57) follows from the triangle inequality ||AXA^||_p < p(A'A)\\X\\ f . 


Define the matrix S as follows: 


[S]« = 


p 


if 


A3 

0, otherwise 


We then have 


I^C<||5eir <1151 


• e 

oo II 11 oo 


(58) 


(59) 


By setting a = II^IC $ in (M9b is a contraction with respect to the weighted-maximum norm 
if there exists some w > 0 such that ||5||™ < 1. Note that all entries of 5 are non-negative. 
According to ll23l . ||5||™ < 1 for some w > 0 if and only if p(S ) < 1. Finally, the radius of 5 
can be easily obtained 


p(S) = Jp ( P 


7i 


-h\ 2 h-}h^h 22 , , 

72 


(60) 


which leads to the condition (21). 
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E. Proof of Theorem [2] 

The main challenge in extending Lemma 4 to the general full-duplex channel is that the 
reverse-order law for the generalized inverse, which is used in ( |56| ), is valid only under the 
full row-rank assumption of H ir In general, one can only have the inequality (AX A) - 1 ■< 
A ~ 'X ' AA For the sake of simplicity, we first focus on the strictly full-rank case, where 
the direct channel matrices are with full column-rank. Using the notations and 

following the procedure in the proof of Lemma 4, we obtain 


< 


Bi ( Q) 


(i) 


Bi Q 


»( 2 ) 


X 0 (Qf ] )-X 0 (Q 


(2)n 




V 


-1 


n H t 


H; 


-1 


+ CjPM( Hij ) 


- [mAXA 


7i H t y (2 ) _1 m 




- CjPM( Hij ) 


H 




-1 


(61) 


The strictly full column-rank of Hij implies that is deficient in row-rank, thus we cannot 
derive the upper-bound for ( [61] ), as was done in ([56]). Thus, we need to develop an approach 
suitable for deriving the upper-bound in the full column-rank case. To this end, we introduce 
the function PAX) : Q t H> C MxM as follows, 


Pi(X) = H’ / + feHjjXH' H 


-1 


-1 


(62) 


where Qi = {Q G H^ f |tr(Q) = Pi} is a convex set. Since is with the full column-rank 
that PfX) is invertible, which implies that PAX) is continuous on Q, and differentiable on 
the interior of Qi Ii26l . Invoking the mean-value theorem in liTTlI . we obtain that for any given 
X, Y £ Qi there exists $, = aX + /3Y with some a, (3 > 0 and a + (3 = 1 such that 


II Pi{Y) - Pi(X)|| F < - X\\ F , (63) 

where DxPj denotes the derivative of the function PAX) with respect to X, defined by the 
following equation ||271 : 

dvec(Pj) = (D x Pi) dvec(X) + (D^Pf, dvec(X t ). (64) 

The above identity implies that we can derive DxPi by differentiating and vectorizing PfX). 
For the convenience of notation, we denote Rj(X ) — I + Arj J:l H J:l XH) r simplified as Rj. 
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Then, 


dvec(Pj) = vec(dPj) = -vec(P t (dP i 1 )P i ) 

vec ( Pi • d ( H\ R- x H 3 ) ■ P 


■ n 3 


= p Vj jVe c ( P1I It - II n ■ d(X) • H Jij 1 //,,P 


(65) 


t z?-i; 


= p Vjj yec (Gi{X)d(X)Gi(X] 


= /3 Vjj G, (X) ® Gi(X) dvec(X), 


( 66 ) 


where G : (X) = PH]!^ II is a Hermitian matrix; <g> denotes the Kronecker product; (|65[) 


follows from the inversibility of P,(X')\ 


follows from the result in [28]. (64) and (66) 


indicate that D x Pi = /3 (G*(X) <g> Gi(X) ). Invoking (63), we obtain 


l|P*0n - Pi WIIf < /3r]jj\\Gi(€) ® ^(Ollalll" - X\\ F . 

We can further investigate ([66]) by 


(67) 


G,(«)®G.(«)|| 2 = Jp G,(«)®G,(«) G,(£)®G,(£) 


= \/p ((£?,«)' ® G?«)j ® G.(«) 

p((Gf«)G^))®(G;(OGi«) 

pfelKJGj:(€)). 


( 68 ) 


Each step in (68) follows from the property of the Kronecker product. Note that the orthogonal 


projection onto the range of P 1 ^H\- i.e., Ran(P L/Z H]X is defined as 


”!/ 2 TT t 


IJ) 


P it = Rj 112 H jj Pj H k Rj 


t 0-1/2 


(69) 


Then, Gj(£) can be rewritten in terms of Pr as G,(£) = //,'/? Pr II 1,z Hjj. It follows 
that 


-1/2; 


P (G-(£)Gi(£)) = P ' p /f p; -//,/// ; ;p; p /f 11/11,, 

< p (h^Hjj) p (RJ\£)) p (ht ] HtA p (J^.(O) • 


( 70 ) 
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Let X = diag(Q^), Y = diag(Q^). We can continue estimate <|6l|), 

-l 


>(2h 


e® : < 




- a 


= — l|P,(y)-P,(X)|| F 
Vij 

Pm 


< 


Vij 

P'Vjj 

Vij 

PVjj 

Vij 


< jpP \ p H-'H-. 1 )p{R-\i)) p(fl,(£)) II Y - X 


(71) 


(72) 


1 + frjjPjP ( H\ H ))p( //' // ) p ( // // ) ||Q™ - Qf || y (73) 


t w - 1 \ nr»( 1 ) _ r>( 2 )i 


1 + /?i)«i> ( // // ) ) P ( // // ) P ( H H 


n ± jr i jj n 


rt 


L jj"n 




(74) 


where ( [71] ) follows from the definition of -Pj(-) in (62); (72) follows from (67), (68) and (70). 
To obtain ([73]), we use the inequality 


J r< H,«) =< / + ftssPsH^H 


(75) 


By using ( |74| ), we can then derive the sufficient condition of the unique NE for the full column- 
rank case. The derivation is same as in Lemma [4[ 

Next, we extend the sufficient condition to the rank deficient case. In order to use the results 
for the full-rank case, the rank-deficient matrix Hjj should be modified to a lower-dimensional 
full-rank matrix Hjj, where H,j G C ;Vxr with r = rank (Hij). The singular value decomposition 
(SVD) writes as H %3 = U l j'E l] V\ J where U l3 and V t] are semi-unitary matrices; V is 
a basis for the row space of H,j. Define H tJ = II ;jV jj and replace H l3 by H t] . Then, 
the optimization problem in ([9]) is modified to the following lower-dimensional optimization 
problem: 


max log dctf / + 
Qi 


-l 

j 




subject to Q, G Xj, 


(76) 


where X t — < Q j g KT I Qi h 0, tr QJ < Pi > and Hjj is full column-rank matrix. Denote the 


optimal solution of problem (76) as B n (Qj)- Next, by building the equivalence between problem 
(44) and problem (76) we can show that Bj(Qj ) = V ,jB,..\Q j)V , r 
First, for any matrix Q, G X t , there exists some Q t G X r such that Q, = V Thus, 

BjQi) = Vi jQ.'vl,, Qj G Xj. Then, using the identity det(J + AB ) = det(J + BA) we 
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obtain 


del (I + 


= del (I + vyHtjBiiQJHlSr') 

-t ut 


= det(7 + jjijHijVijQi V^H]^) 
= det (/ + ri, j H, J Q,'H\ 1 S" 1 ) 

= det(/ + Ti, j H\ J T.-'H Lj Q"). 


Q l is a feasible solution of problem (76), implying that the maximum of problem (76) is no 
smaller than that of problem (44). On the other hand, V ijB ri (Q OVj- is feasible for problem 


((44j). Thus the maximum of problem ( |76| ) is no greater than that of problem ( |44] ). Therefore, 
problem (44) and problem (76) are equivalent, implying B^QA = VijB ri (Q ■) V|-. 


We now can continue to prove Theorem |2j Similar to the proof of the full-rank case, we have 
e*. = 


(77) 


e. (q?) 

- B, (Qf) 

F 



v,,B r . (Qj 11 ) vt - V 

B 

ijLJri 

(q?) 

vt. 

U 

P(VtVy) 

B n {Q?) 

Br, 

[q?) 

F 

Br. « 

- Br. {Q?\ 

), 




A 

7? 


< — ( 1 + frljjPjP ( // ; //,/ 




Q? - Q? ] 


(78) 


= r-(l +p Vij P lP (H) j H„ 


Is 




where ( |77[ ) follows from the the triangle inequality | AX A'| F < A)\\X\\ f ; (J78J) follows 


from (74). The relationship between H V] and H t] implies that 

p (S.lA:/) = P (h-^h^ 

The rest of the proof can then be completed following the same steps in Lemma 4. 


(79) 
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